A simple, effective distance and density based outlier detection algorithm
نویسندگان
چکیده
Outliers are eccentric data points with anomalous nature. Clustering outliers has received a lot of attention in the processing community. But, they inordinately affect quality results obtained case popular clustering algorithms during process finding an optimal solution. In this work, we propose novel method to classify grouping characteristics as either outlier or not. We use both distance and density particular point respect rest for process. Distances used find at extremities while densities identify sparsest spaces. Further, every model take into account aspect generalization order work robustly even out box situations. Hence, our approach provides model. The accuracy proposed is measured using area under curve (AUC) was found highest cardioto set -AUC value-0.90 second AUC value Spambase -0.52 several other datasets demonstrate usage proposed.
منابع مشابه
A simple and effective outlier detection algorithm for categorical data
Outlier detection is an important data mining task that has attracted substantial attention within diverse research communities and the areas of application. By now, many techniques have been developed to detect outliers. However, most existing research focus on numerical data. And they can not directly apply to categorical data because of the difficulty of defining a meaningful similarity meas...
متن کاملAn Effective Genetic Algorithm for Outlier Detection
The main objective the outlier detection is to find the data that are exceptional from other data in the data set. Detection of such exceptional data’s is an important issue in many fields like fraud detection, Intrusion detection and Medicine . In this paper we are proposing an algorithm to detect outliers using genetic algorithm. The proposed method was exceptionally accurate in identifying t...
متن کاملAn Effective Genetic Algorithm for Outlier Detection
The main objective the outlier detection is to find the data that are exceptional from other data in the data set. Detection of such exceptional data’s is an important issue in many fields like fraud detection, Intrusion detection and Medicine . In this paper we are proposing an algorithm to detect outliers using genetic algorithm. The proposed method was exceptionally accurate in identifying t...
متن کاملRapid Distance-Based Outlier Detection via Sampling
Distance-based approaches to outlier detection are popular in data mining, as they do not require to model the underlying probability distribution, which is particularly challenging for high-dimensional data. We present an empirical comparison of various approaches to distance-based outlier detection across a large number of datasets. We report the surprising observation that a simple, sampling...
متن کاملDistance-based Outlier Detection in Data Streams
Continuous outlier detection in data streams has important applications in fraud detection, network security, and public health. The arrival and departure of data objects in a streaming manner impose new challenges for outlier detection algorithms, especially in time and space efficiency. In the past decade, several studies have been performed to address the problem of distance-based outlier de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Indonesian Journal of Electrical Engineering and Computer Science
سال: 2021
ISSN: ['2502-4752', '2502-4760']
DOI: https://doi.org/10.11591/ijeecs.v24.i2.pp1141-1148